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2. Title of Invention Method for reducing system boot time bv retaining data used during the initialization sequence in a non~ 
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Boxcar 
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Please attach a description of the invention to this form, DATED AND SIGNED BY AT LEAST 
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1. Describe in detail what the components of the invention are and how the 
invention works. 
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3. YOU MUST include at least one figure illustrating the invention. If the 
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representation of the algorithm. 

4. Value of your invention to Intel (how will it be used?). 

5. Explain how your invention is novel. If the technology itself is not new, 
explain what makes it different 

6. Identify the closest or most pertinent prior art that you are aware of. 
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*HAVE YOUR SUPERVISOR READ, DATE AND SIGN COMPLETED FORM 



DATE: 



4^- 



SUPERVISOR 




BY THIS SIGNING. I (SUPERVISOR) ACKNOWLEDGE THAT I HAVE READ AND UNDERSTAND THIS 
DISCLOSURE, AND RECOMMEND THAT THE HONORARIUM BE PAID 



December 2000 



Page 3 



REV. 14 




INTEL CONFIDENTIAL 

Background * 

Intel, through CBD, is making/ has made a series of investments in technologies and companies with 
technologies that have potential to be "disruptive" technologies with respect to mass storage (Boxcar, Black 
Butte for instance). The Boxcar investment also includes significant seed money. These technologies are 
aimed at the performance of mass storage and the ability of mass storage to "scale" to smaller sizes for things 
like basic PCs and appliances. The goal is to make the performance of mass storage in PC and Intel based 
appliances much better, as well as developing (or at least preserving the option of developing) a significant 
new business line. Since we are contemplating a new business, developing a body of IP is important. Over the 
next year to 18 months, the technologies will reach the prototype stage. Initially, it is likely that these 
technologies are best deployed as very large (on the order of 1 Gbyte) non-volatile caches for mass storage. 
We are starting the architecture of prototype devices (caches) that would take advantage of these 
technologies. This work is resulting in a series of invention disclosures covering the various ways to implement 
these caches. 

The Invention : 

This invention specifies a method for retaining in the non-volatile storage array the data expected to be 
needed during system initialization or "reboot". The sequence of disk accesses required to load the operating 
system and launch system services is predictable and should be a small fraction of the total non-volatile 
storage array. Normal operation would cause this initialization data to be brought into the cache, but over time 
to be evicted as new data is brought into the cache Therefore, this invention is to let data accessed during 
initialization be loaded into the cache as normal and then marked (pinned) to prevent eviction. Even reboots 
following an unexpected system shutdown would find the data necessary resident in the cache thus avoiding 
costly disk accesses. An upper bound on the amount of data pinned into the cache in this way would be 
enforced to limit the amount of space occupied during normal execution. 

The following figure illustrates the hardware context for the invention. A non-volatile storage array (could be 
flash, polymer memory, probe storage, etc.) with block oriented ECC support, support for write back of 
destructive reads (required for some of the technologies) waveform generation (some technologies have 
unusual requirements) etc. is part of an add-in PCI card, for example, (other embodiments beyond PCI are 
taught in other disclosures already submitted and now patent applications. P8059. 8060 ,8062, 8063) 




PCI add-in 
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The pinning of data would be accomplished with the addition of a "pinned bit" to the meta-data state for each 
cache line. We distinguish between meta-data required for correct operation and meta-data helpful for 
improving performance but not required for correct operation. The meta-data required for correct operation 
includes an isValid bit, an isDirty bit and a tag indicating the starting disk address for the data contained in the 
cache line. This meta-data is kept both in memory and in the non-volatile cache. The meta-data that is not 
required for correct operation but that is helpful for improving performance is kept only in memory. This meta- 
data includes the ape of each cache line for use in the LRU replacement algorithm. The "pinned bit" required 
to support this invention would be placed in this non-persistent meta-data as well. (Note: The use of the term 
non-persistent meta-data does not imply that it will never be stored in the non-volatile cache. We could 
choose to store it there in order to preserve information between reboots of the machine. ) 

Persistent Meta-Data 
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Non-persistent Meta-Data 


Pinned 


Cached 


Age 




Before 






Reboot 





We also add a "cachedBeforeReboot bit" to the non-persistent meta-data. This bit will be set for any cacheline 
that was present in the non-volatile cache when the machine was restarted. (These bits will actually be set as 
the persistent meta-data is paged in from the non-volatile cache. 

On reboot, we expect to find data pinned during the last initialization sequence to be already loaded into the 
cache. However, with the pinned bit stored only in memory, we would not know whether an individual cache 
line was pinned or whether it was simply still in the cache. 

During each initialization sequence, we can choose which data to pinn. We need to balance the desire to 
improve reboot performance with the desire to keep most of the cache free during normal operation. 
Therefore, we limit the number of lines per set that may be pinned. 

Upon initialization, we set a timer . Until this timer expires, we will consider pinning data into the cache in order 
to improve the performance of the next reboot. During this time when a cache line is accessed, we look at the 
meta-data for all lines in the corresponding set in the cache. If we have not yet exceeded the maximum 
number of pinned lines in that set, then we set the pinned bit. If we have already pinned the maximum number 
of lines, then we look at the cachedBeforeReboot bits. If any of the pinned lines where not 
cachedBeforeReboot and the current line was, then we discard a line that does not have the 
cachedBeforeReboot bit set. If there are more than one, we discard the line with the latest age. If neither the 
new line nor the currently pinned lines were cachedBeforeReboot, then bring the new line into the cache but 
do not pinn it. 

This algorithm gives preference to pinning lines that were already in the cache - thus having demonstrated 
their usefulness at least twice. The algorithm also gives preference to lines with earlier access times in order 
to capture the true initialization sequence and not early user activity. 

We suggest that the maximum number of pinned lines per set be set to 1 to preserve the majority of each set 
for normal operation. We suggest that the timer be set to 2 minutes. In setting the timer, we suggest that it is 
better to allow a generous amount of time for the initialization sequence and rely on the above algorithm to 
make the appropriate trade-offs to limit impact on normal operation. 
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The time required to reload the operating system and restart system services is a very visible source of 
irritation to users. Much of this time is devoted to reading necessary data from disk. The sequence of disk 
blocks read during start-up is quite predictable on a given machine. Therefore, the initialization time could be 
reduced if the necssary blocks where already loaded into a non-volatile cache. 

The unique aspects of the invention 

Pinning data into a cache is not unique. However, we are not aware of other systems that pin data into a non- 
volatile cache, and no systems that pin data in order to reduce the time needed for next restart. We are aware 
of no mass storage cache, volatile or non-volatile, that pins data 

Prior art : 

Many operating systems provide an interface by which programmers can specify that certain portions of the 
address space of their process should be pinned into memory. Operating systems may also pin certain data 
into memory at will . Sometimes this is done to improve performance of the file system or virtual memory 
system. Sometimes this is required for correctness (e.g. device drivers that need to have certain pages in 
memory at all time). 

Likely Infringers and Detection : 

Any other company deciding to produce a non-volatile mass storage cache would be a likely infringer. To 
detect infringment, the initialization sequence of a device with the non-volatile cache should be analyzed. If 
accesses consistently hit in the cache during the initialization sequence, infringement should be suspected. 
Access to source code specifying the eviction polices for the cache would be the easiest way to prove 
infringement 
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